Mahoning County
LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions
Gupta, Gaurav Kumar, Pande, Pranal
Large Language Models (LLMs) are revolutionizing medical diagnostics by enhancing both disease classification and clinical decision-making. In this study, we evaluate the performance of two LLM- based diagnostic tools, DeepSeek R1 and O3 Mini, using a structured dataset of symptoms and diagnoses. We assessed their predictive accuracy at both the disease and category levels, as well as the reliability of their confidence scores. DeepSeek R1 achieved a disease-level accuracy of 76% and an overall accuracy of 82%, outperforming O3 Mini, which attained 72% and 75% respectively. Notably, DeepSeek R1 demonstrated exceptional performance in Mental Health, Neurological Disorders, and Oncology, where it reached 100% accuracy, while O3 Mini excelled in Autoimmune Disease classification with 100% accuracy. Both models, however, struggled with Respiratory Disease classification, recording accuracies of only 40% for DeepSeek R1 and 20% for O3 Mini. Additionally, the analysis of confidence scores revealed that DeepSeek R1 provided high-confidence predictions in 92% of cases, compared to 68% for O3 Mini. Ethical considerations regarding bias, model interpretability, and data privacy are also discussed to ensure the responsible integration of LLMs into clinical practice. Overall, our findings offer valuable insights into the strengths and limitations of LLM-based diagnostic systems and provide a roadmap for future enhancements in AI-driven healthcare.
Lessons from a human-in-the-loop machine learning approach for identifying vacant, abandoned, and deteriorated properties in Savannah, Georgia
Liang, Xiaofan, Brainerd, Brian, Hicks, Tara, Andris, Clio
Addressing strategies for managing vacant, abandoned, and deteriorated (VAD) properties is important for maintaining healthy communities. Yet, the process of identifying these properties can be difficult. Here, we create a human-in-the-loop machine learning (HITLML) model called VADecide and apply it to a parcel-level case study in Savannah, Georgia. The results show a higher prediction accuracy than was achieved when using a machine learning model without human input in the training. The HITLML approach also reveals differences between machine vs. human-generated results. Our findings contribute to knowledge about the advantages and challenges of HITLML in urban planning.
Digital Diagnostics: The Potential Of Large Language Models In Recognizing Symptoms Of Common Illnesses
Gupta, Gaurav Kumar, Singh, Aditi, Manikandan, Sijo Valayakkad, Ehtesham, Abul
The recent swift development of LLMs like GPT-4, Gemini, and GPT-3.5 offers a transformative opportunity in medicine and healthcare, especially in digital diagnostics. This study evaluates each model diagnostic abilities by interpreting a user symptoms and determining diagnoses that fit well with common illnesses, and it demonstrates how each of these models could significantly increase diagnostic accuracy and efficiency. Through a series of diagnostic prompts based on symptoms from medical databases, GPT-4 demonstrates higher diagnostic accuracy from its deep and complete history of training on medical data. Meanwhile, Gemini performs with high precision as a critical tool in disease triage, demonstrating its potential to be a reliable model when physicians are trying to make high-risk diagnoses. GPT-3.5, though slightly less advanced, is a good tool for medical diagnostics. This study highlights the need to study LLMs for healthcare and clinical practices with more care and attention, ensuring that any system utilizing LLMs promotes patient privacy and complies with health information privacy laws such as HIPAA compliance, as well as the social consequences that affect the varied individuals in complex healthcare contexts. This study marks the start of a larger future effort to study the various ways in which assigning ethical concerns to LLMs task of learning from human biases could unearth new ways to apply AI in complex medical settings.
A Language Model for Particle Tracking
Huang, Andris, Melkani, Yash, Calafiura, Paolo, Lazar, Alina, Murnane, Daniel Thomas, Pham, Minh-Tuan, Ju, Xiangyang
Particle tracking is crucial for almost all physics analysis programs at the Large Hadron Collider. Deep learning models are pervasively used in particle tracking related tasks. However, the current practice is to design and train one deep learning model for one task with supervised learning techniques. The trained models work well for tasks they are trained on but show no or little generalization capabilities. We propose to unify these models with a language model. In this paper, we present a tokenized detector representation that allows us to train a BERT model for particle tracking. The trained BERT model, namely TrackingBERT, offers latent detector module embedding that can be used for other tasks. This work represents the first step towards developing a foundational model for particle detector understanding.
The Creative Ways Teachers Are Using ChatGPT in the Classroom
Peter Paccone, a social studies teacher in San Marino, Calif., has a new teacher's aid helping him in the classroom this year. He plans to defer to his helper to explain some simpler topics to his class of high schoolers, like the technical aspects of how a cotton gin worked, in order to free up time for him to discuss more analytical concepts, like the effects of the first industrial revolution. "What I feel that I don't have to do any longer is cover all the content," Paccone told a group of more than 40 educators in a May Zoom workshop, which he organized. If artificial intelligence is on the cusp of reshaping entire aspects of our society--from healthcare to warfare--the first realm that leaps to many minds is education: Asked a question online, the ChatGPT chatbot will produce an answer that reads like an essay. So as students and teachers prepare for a new school year, they are also grappling with AI's implications for learning, homework, and integrity.
How Does Artificial Intelligence Work?
"Artificial intelligence is training computers on past history and letting it be aware of all that's out there, so that when you ask it a question it's going and looking at all of the things it has seen and giving you an answer based on that," Zerbonia says. The Business Journal Roundtable Series is sponsored by iSynergy.
Well-definedness of Physical Law Learning: The Uniqueness Problem
Scholl, Philipp, Bacho, Aras, Boche, Holger, Kutyniok, Gitta
Physical law learning is the ambiguous attempt at automating the derivation of governing equations with the use of machine learning techniques. The current literature focuses however solely on the development of methods to achieve this goal, and a theoretical foundation is at present missing. This paper shall thus serve as a first step to build a comprehensive theoretical framework for learning physical laws, aiming to provide reliability to according algorithms. One key problem consists in the fact that the governing equations might not be uniquely determined by the given data. We will study this problem in the common situation that a physical law is described by an ordinary or partial differential equation. For various different classes of differential equations, we provide both necessary and sufficient conditions for a function to uniquely determine the differential equation which is governing the phenomenon. We then use our results to devise numerical algorithms to determine whether a function solves a differential equation uniquely. Finally, we provide extensive numerical experiments showing that our algorithms in combination with common approaches for learning physical laws indeed allow to guarantee that a unique governing differential equation is learnt, without assuming any knowledge about the function, thereby ensuring reliability.
Remote Data Scientist openings near you -Updated October 19, 2022 - Remote Tech Jobs
The Data Scientist applies strong expertise in machine learning, data mining, and information retrieval to design, prototype, and build next generation advanced analytics engine and services. They collaborate with translators to define technical problem statement and hypothesis to test and develops efficient and accurate analytical models that mimic business decisions.
Edible Arrangements Made a Stunning Comeback. Then the Corporate Drama Spilled Into Public.
You could think of it as a rebirth. Or maybe it's one of those COVID-era glow-ups that had people emerging from isolation with straighter teeth and cuter clothes. Whatever you want to call it, Edible Arrangements is in the middle of a major transformation. A few years ago, the company that introduced the world to bouquets of skewered fruit was in freefall. Now, after a bunch of new product launches, one hired-and-fired CEO, and a pandemic, the company is boasting record-setting sales numbers and a renewed sense of self. It's even changed its name: The new Edible sells desserts and doodads of all kinds--not just fruit--and aims to be, as the CEO put it, "the Domino's of gifting." But like most extreme makeovers, Edible's has its detractors, specifically within its own ranks.
Performance of a Geometric Deep Learning Pipeline for HL-LHC Particle Tracking
Ju, Xiangyang, Murnane, Daniel, Calafiura, Paolo, Choma, Nicholas, Conlon, Sean, Farrell, Steve, Xu, Yaoyuan, Spiropulu, Maria, Vlimant, Jean-Roch, Aurisano, Adam, Hewes, V, Cerati, Giuseppe, Gray, Lindsey, Klijnsma, Thomas, Kowalkowski, Jim, Atkinson, Markus, Neubauer, Mark, DeZoort, Gage, Thais, Savannah, Chauhan, Aditi, Schuy, Alex, Hsu, Shih-Chieh, Ballow, Alex, Lazar, and Alina
The Exa.TrkX project has applied geometric learning concepts such as metric learning and graph neural networks to HEP particle tracking. Exa.TrkX's tracking pipeline groups detector measurements to form track candidates and filters them. The pipeline, originally developed using the TrackML dataset (a simulation of an LHC-inspired tracking detector), has been demonstrated on other detectors, including DUNE Liquid Argon TPC and CMS High-Granularity Calorimeter. This paper documents new developments needed to study the physics and computing performance of the Exa.TrkX pipeline on the full TrackML dataset, a first step towards validating the pipeline using ATLAS and CMS data. The pipeline achieves tracking efficiency and purity similar to production tracking algorithms. Crucially for future HEP applications, the pipeline benefits significantly from GPU acceleration, and its computational requirements scale close to linearly with the number of particles in the event.